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community  regarding  technical  activities  of  the  Center,  completed  and 
in  progress.  They  are  intended  to  stimulate  thinking  and  encourage 
information  exchange;  but  they  do  pot  represent,  an  approved  position 
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for  related  planning  and/or  further  action. 
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I.  INTRODUCTION 


set  of  experiinents  was  conducted  to  evaluate  the  performance 
of  the  Continuously  Variable  Slope  Delta  (CVSD)  speech  coding  algorithm. 

These  experiments  quantitatively  measured  the  performance  sensitivity 
of  CVSD  coding  with  respect  to  the  algorithm  parameters. 

The  parameters  examined  included  time  constants,  compression  ratio, 
and  filter  bandwidths.  The  performance  of  a CVSD  coder  in  terms  of 
speech  quality  and  intelligibility  depends  on  the  values  of  these 
parameters.  Performance  also  depends  to  some  extent  on  the  type  of 
implementation;  that  is,  analog  or  digital. 

In  addition  to  measuring  the  relative  sensitivity  of  the  algorithm 
to  the  basic  parameters,  these  experiments, were  also  used  to  determine 
an  appropriate  set  of  parameters  for  DCS  applications. 

It  was  found  that  CVSD  performance  is  most  sensitive  to  compression- 
ratio.  Relatively  high  values  of  compression  ratio  result  in  the  best 
performance.  In  addition,  the  results  indicate  a user  preference  for 
a filter  bandwidth  of  3 kHz  compared  with  a bandwidth  of  4 kHz.  Based  on 
these  results,  certain  parameters  have  been  selected  for  CVSD  cOcer  operation 
under  conditions  of  moderate  transmission  bit  error  rates. 


II.  BACKGROUND 


Results  from  the  Narrowband  Consortium  tests  [1]  show  that  the 
performance  of  CVSD  coders  depends  on  such  factors  as  the  condition 
of  the  input  signal  and  transmission  error  rate.  For  example,  it  has 
been  found  that  there  is  a degradation  in  speech  quality  when  speech 
synthesized  by  a linear  predictive  coding  ('PC)  terminal  is  applied 
to  a CVSD  coder.  Experimental  evidence  suggests  that  modifications 
of  the  CVSD  parameters  can  result  in  improved  performance  of  this 
LPC  tandem  configuration. 

In  a tactical  communications  environment,  there  is  a requirement 
for  CVSD  coder  operation  under  conditions  of  a 10%  transmission  error 
rate.  However,  in  the  Defense  Communications  System  (DCS),  error 
rates  are  expected  to  be  less  than  0.1%.  These  two  different  require- 
ments impose  different  constraints  on  the  CVSD  parameter  sets.  One  aim 
of  the  tests  described  here  is  to  determine  if  there  is  any  significant 
difference  in  the  performance  of  CVSD  coders  whose  parameters  are  optimized 
to  satisfy  these  different  requirements. 

The  design  of  the  CVSD  digital  voice  coder  is  illustrated  by  the 
block  diagram  shown  in  Figurd  1;  this  diagram  shows  how  the  binary 
transmission  data  and  output  speech  are  derived.  The  function  of  the 

s 

logic  circuit , is  to  generate  a pulse  whenever  three  successive 
transmission  bits  are  identi,cal.  The  basic  parameters  of  the  CVSD 
algorithm  are:  • ' 


2. 


thm 


a.  Input  and  Output  Filter  Bandwidth 

b.  Maximum  Slope, 

c.  Minimum  Slope, 

mm 

d.  Step-Size-Filter  Time  Constant, 

e.  Integrator  Time  Constant,  T2. 

The  ratio  of  maximum  slope  to  minimum  slope  is  often  called  compression 
ratio.  The  performance  of  a CVSD  coder  in  terms  of  speech  quality  and 
intelligibility  depends  on  the  values  of  these  parameters.  Performance 
also  depends  to  some  extent  on  the  type  of  implementation;  that  is, 
analog  or  digital  circuit  implementation. 


III.  TEST  DESCRIPTION 


Tests  were  performed  to  measure  quantitatively  the  performance 
sensitivity  of  the  CVSO  algorithm  with  respect  to  time  constants, 
compression  ratios,  and  filter  bandwidths.  The  Sylvania  Programmable 
Signal  Processor  (PSP)  was  used  to  simulate  the  CVSD  coder  operating 
in  real  time.  The  details  of  the  simulation  are  described  in  the 
Sylvania  final  report  [2].  A variety  of  CVSD  parameter  sets  were 
selected  and  tested  using  the  PSP  simulation.  In  addition,  a CVSD 
coder  built  by  Codex  Corporation  was  tested.  This  coder  is  a digitial 
circuit  implementation  that  was  designed  to  optimize  speech  quality 
under  conditions  of  moderate  transmission  error  rates.  The  details 
of  the  Codex  CVSD  speech  terminal  are  included  in  the  Codex  fi.ial 
report  [3]. 

In  these  tests  the  speech  coders  were  operated  at  a transmission 

rate  of  16  kb/s  with  no  transmission  errors.  This  condition  closely 

approximates  operation  in  the  DCS  where  error  rates  are  expected  to 

be  less  than  Q.1%.  The  performance  of  the  CVSO  coders  was  measured  in 

terms  of  both  intelligibility  and  quality.  The  Diagnostic  Rhyme  Test 

. ’ • 

(DRT)  was  used  to  measure  word  inteMigibility  [1^.  . The  DRT  employed 

three  speat^ers  usi-'g  one-half  of  a full  word  list;  the  speakers  consisted 

of  two  males  and  one  female.  In  the  set  of  tests  described,  the  Quality 

Acceptance  Rating  Test  (QUART)  was  used  to  measure  quality  [4],  It  is 

generally  accepted' that  a' five  point  different  between  quality  test 

scores- represents  a-. significant  difference  in  speech  quality.  All 

intelligibility  and  quality  testing,  was “accompli shed  by  Dynastat  Corp. 


IV.  RESULTS 


1.  COMPRESSION  RATiC 

Intelligibility.  In  Cf-der  to  measure  the  effect  of  compression 

ratio  on  output  speecu  . Igibility,  tests  were  performed  using 
the  -?P  wuh  four  different  compression  ratios  ranging  from  10  to 
ISO.  "ihe  remain",  j .a'-ameters  were  fixed  at  the  following  values: 

= 6.3  f;iS 
T,  = 1 0 ms 
Beir.dwidtr,  - 3 kHz 

Figure  2 is  a plot  of  DRT  ,'cores  corresponding  to  the  four  different 
compression  ratios.  It  should  be  recognized  that  DRT  scores  above 
9C  represent  highly  intelligible  systems.  This  graph  shows  that 
intelligibility  scores  generally  increase  if  the  compression  ratio  is 
increased  from  10  to  150.  The  ORT  scores  are  maximum  when  corresponding 
to  5 compression  ratio  of  150;  the  scores  are  minimum  for  compression 
ratios  in  the  range  of  10  to  20.  For  male  speakers  the  difference 
between  maximum  and  minimum  scores  is  2.9  points;  the  standard  error 
for  this  DRT  is  approximately  1.0.  For  female  speakers  the  difference 

between  maximum  and  minimum  scores  Is  5.2  points  with  a standard,  error 
of  approximately  1.0.  These  results  indicate  a measurable  and^ 

statistically  significant  improvement  in  the  intelligibility  of  speech 
processed  by  a CVSD  coder  if  the-  compression  ratio  is.  increased  from  10  to 
150.  Additional  tests  with  a compression  ratio  of  200  did  not  indicate 
much  improvement  in  performance  -compared  with  'a  compression  .ratio  of  150;, 
also  with  a compression  ratio^of  EdO^perfortnance  was  degraded  under 
conditions  of  transmission' errors. 
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b.  Quality.  Qua! tty  measurements  were  accomplished  using  the 
QUART  which  assigns  a numerical  rating  to  predict  user  preference  for 
communications  systems.  Figure  3 is  a plot  of  QUART  scores  corresponding 
to  four  different  compression  ratios.  The  standard  error  of  these 
quality  scores  is  approximately  2.5.  The  difference  between  the 
maximum  and  minimum  QUART  scores  is  10.1  points.  These  results  indicate 
that  quality  scores  increase  monotonically  with  compression  ratio  and 
that  there  is  a significant  user  preference  for  the  highest  value  of 
compression  ratio.  Figure  4 shows  the  relation  between  quality  scores 
and  the  percentage  of  people  who  find  the  communications  acceptable  for 
routine  use. 

2.  FILTER  BANDWIDTH 

a.  Intelligibility.  The  effects  of  changing  the  input  and  output 

filter  bandwidths  were  measured.  Table  I lists  the  DRT  scores 

corresponding  to  filter  bandwidths  of  approximately  3 kHz  and  4 kHz; 

these  are  the  cutoff  frequencies  of  the  low  pass  output  and  input  filters 

which  are  seventh  order  elliptic  filters.  The  standard  error  of  each 

DRT  is  shown  to  the  right  of  the  score.  Although  the  intelligibility 

score  for  the  female  speaker  is  unchanged,  the  score  for  the  male 

" ' ♦ 

speakers  is  slightly  higher,  corresponding  to  the  increased  bandwidth. 
With  a compression  ratio  of  150,  the  DRT  scores  are  increased  slightly 
for  male  speakers  and  decreased  for  female  speakers,  corresponding  to 
the  increased  bandwidfh.  These  results  indicate  that  there  is  no 
significant  overall  difference  in  DRT  scores  corresponding  to  the  two 
filter  bandwidths. 


^8- 


Figure  4.  Relationship  of  Quality  Scores  to  Acceptability 


b.  Qual ity.  The  results  of  the  quality  tests  are  also  listed 
in  Table  I.  These  results  indicate  a rather  significant  user  preference 
for  the  3 kHz  bandwidth  cotppared  with  the  4 kHz  bandwidth.  The 
improvement  in  the  QUART  score  is  approximately  5 points  when  the  cutoff 
frequency  is  reduced  from  4 kHz  to  3 kHz. 

3.  TIME  CONSTANTS 

Tests  were  performed  to  measure  the  performance  sensitivity  of 
the  CVSO  algorithm  with  respect  to  the  time  constants  of  the  integrator 
and  step-size-filter.  For  these  tests  the  following  parameters  were 


Compression  ratio  « 10 

Filter  Bandwidth  « 3 kHz. 

a.  Step-Size-Filter.  Table  II  contains  DRT  scores  corresponding 
to  two  different  values  of  T.| . These  results  show  that  intelligibility 
is  generally  less  for  the  time  constant  of  20  ms.  Also  shown  in  Table  II 
are  the  quality  scores  corresponding  to  the  two  different  values  of  T^ . 
The  quality  scores  indicate  a general  reduction  in  user  preference  for 
the  higher  value  of  T.j . 

b.  Integrator.  Table  II  also  lists  the  DRT  .and  QUART  scores 
corresponding  to  two  different  values  of  the  integrator  time  constant, 

Tg.  These  results  suggest  that  if  the  integrator  time  constant  is 
varied  within  the  limits  of  1 ms  to  20  ms  there  is  negligible  change 
in  intelligibility  ailti  user  preference. 

4.  COMPARISON  OF  CVSD  CODERS 

The  CVSD  coder  proposed  for  use  ,jn  the  TRI-TAC  secure  voice  system 
is  based  on  the  following  set'of  parameters: 


/ 
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Filter  Bandvn'dth  = 4 kHz 
Compression  Ratio  = 12 
= 6.3  ms 
T2  = 1 .0  ms. 

This  particular  algorithm  (CVSD-A)  was  simulated  on  the  PSP.  The 
corresponding  DRT  and  QUART  scores  are  listed  in  Table  III.  The  CVSD 
coder  built  by  Codex  Corporation  is  based  on  the  following  set  of 
parameters  (CVSD-B): 

Filter  Bandwidth  - 3 kHz 
Compression  Ratio  = 166 
T.|  = 2 ms 
T2  = 8 ms. 

DRT  and  QUART  scores  corresponding  to  the  Codex  terminal  are  listed 
in  Table  III.  Also  included  in  Table  III  are  the  scores  corresponding 
to  a PSP  simulation  of  a version  of  CVSD-B.  The  results  indicate  a 
strong  user  preference  for  CVSD-B  over  CVSD-A. 


V.  CONCLUSIONS 

Speech  intelligibility  and  quality  of  CVSD  coders  are  sensitive 
to  the  values  of  certain  algorithm  parameters.  In  particular,  it 
was  found  that  CVSD  performance  is  most  sensitive  to  compression  ratio; 
the  highest  value  of  compression  ratio  resulted  in  the  best  performance. 
In  addition,  the  quality  scores  indicated  a significant  user  preference 
for  a filter  bandiwdth  of  3 kHz. 

Results  have  also  shown  that  within  the  ranges  tested,  the  time 
constants  of  the  integrator  and  step-size-filter  have  small  effects  on 
quality  and  intelligibility.  However,  these  time  constants  do  affect 
performance  under  conditions  of  high  transmissions  error  rates. 

Therefore  the  following  time  constants,  which  have  been  selected  for 
such  conditions,  are  appropriate: 

= 6.3  ms 
= 1.0  ms 

Although  the  tests  performed  were  by  no  means  exhaustive,  the 
results  can  be  used  to  quantify  the  performance  of  CVSD  coders 
corresponding  to  different  parameter  sets.  The  test  results  shpw  that 
the  CVSD-B  parameters  provide  the  best  performance  compared  with  the 
other' parameter  sets  including  the  CVSD-A  parameters.  The  quality  tests 
indicated  the  greatest  preference  for  the  Codex  CVSD  coder.  The  test 
results  also  indicate  that  the  ‘following  CVSD  parameters  should 
provide  performance  comparable  with  CVSD-B. 


{ Filter  Bandwidth  = 3 kHz  J 

■;  Compression  ratio  =150 

I i 

= 6.3  ms  I 

T.,  = 1 .0  ms  ; 
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